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Abstract 

The Major Histocompatibility Complex (MHC, 6p21) codes for traditional HLA and other host response related genes. The 
polymorphic HLA-DRB1 gene in MHC Class II has been associated with several complex diseases. In this study we focus on 
MHC haplotype structures in the Finnish population. We explore the variability of extended HLA-DRB1 haplotypes in relation 
to the other traditional HLA genes and a selected group of MHC class III genes. A total of 150 healthy Finnish individuals 
were included in the study. Subjects were genotyped for HLA alleles [HLA-A, -B, -DRB1, -DQB1, and -DPB1). The 
polymorphism of TNF, LTA, C4, BTNL2 and HLA-DRA genes was studied with 74 SNPs (single nucleotide polymorphism). The 
C4A and C4B gene copy numbers and a 2-bp silencing insertion at exon 29 in C4A gene were analysed with quantitative 
genomic realtime-PCR. The allele frequencies for each locus were calculated and haplotypes were constructed using both 
the traditional HLA alleles and SIMP blocks. The most frequent Finnish A~B~DR -haplotype, uncommon in elsewhere in 
Europe, was A*03~B*35~DRB1*01:01 . The second most common haplotype was a common European ancestral haplotype 
AH 8.1 (A*01 ~B*08~DRB1*03:01). Extended haplotypes containing HLA-B, TNF block, C4 and HLA-DPB1 strongly increased 
the number of HLA-DRB1 haplotypes showing variability in the extended HLA-DRB1 haplotype structures. On the contrary, 
BTNL2 block and HLA-DQB 7 were more conserved showing linkage with the HLA-DRB1 alleles. We show that the use of HLA- 
DRB1 haplotypes rather than single HLA-DRB1 alleles is advantageous when studying the polymorphisms and LD patters of 
the MHC region. For disease association studies the HLA-DRB1 haplotypes with various MHC markers allows us to cluster 
haplotypes with functionally important gene variants such as C4 deficiency and cytokines TNF and LTA, and provides 
hypotheses for further assessment. Our study corroborates the importance of studying population-specific MHC haplotypes. 
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Introduction 

The Major Histocompatibility Complex located on chromo- 
some 6p21 has a complex allelic structure with extended linkage 
disequilibrium (LD) and polymorphism. The traditional human 
leukocyte antigen (HLA) genes encode the cell-surface antigen- 
presenting proteins, the HLA molecules, and fall into two major 
MHC classes; Class I {HLA-A, -C, and -B) and Class II (HLA- 
DRB1, -DQB1, and -DPB1). Many non-HLA genes related to 
immune responses e.g. tumor necrosis factor [TNF), lymphotoxin-alpha 
{LTA), complement C4 genes {C4A and C4B) and butyrophilin-lih protein 
2 {BTNL2), are located in the MHC class III region that resides 
between the MHC Class I and II regions [1-3]. The location of 
recombination hotspots and the length of LD blocks (genomic 
fragments inherited together) are population specific [1,4,5]. 
Interestingly, it has been shown that the Finnish population has 
distinctive population substructure compared with other Europe- 
ans [6-8]. 

The HLA genes play a critical role in hematopoietic stem cell 
transplantation, and HLA mismatching has been associated with 
graft failure and graft-versus-host disease [9, 10]. In addition to the 



traditional HLA genes, it has been suggested that the HIA-SNP 
haplotypes influence on the outcome of the transplantation [11]. 
The association between MHC and diseases has been known for 
decades. The polymorphic HLA-DRB1 has been associated with 
several complex and infectious diseases, such as acute coronary 
syndrome [12], HSV2 related meningitis [13], type I diabetes, 
celiac disease and multiple sclerosis (reviewed in [14]). Recently, 
genome wide association analyses (GWA) have increased the 
number of MHC gene associations with several drug-induced 
hypersensitivity reactions, autoimmune, infectious and inflamma- 
tory disorders [15-20] (GWAS Integrator, http://genome.ucsc. 
edu). However, the predisposing HIA-alleles are common in the 
healthy population. Other genetic and environmental triggers are 
required for disease susceptibility [15,21]. Thus, the complexity of 
MHC makes the variant(s) responsible for the causal effect difficult 
to pinpoint. As the traditional HLA typing is considered rather 
expensive and laborious and the analysis of MHC regions is 
complex, many studies do not probe MHC associations more 
closely. Thus the functionality of the variants remains uncertain 
although identification of a specific risk HLA allele could help to 
understand a disease and its causality further. 
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At present, for transplantation and many diagnostics proposes, 
traditional HLA typing still remains as the most used method [14]. 
De Bakker [22] presented a tag-SNP based HLA-typing method as 
an alternative solution for traditional HLA typing. It has promoted 
the screening of certain disease related HLA markers [21,23]. 
However, not only HLA allele and haplotype frequencies [8,24] 
but also the SNP content of haplotypes differ in ethnically diverse 
populations complicating the imputation process. It is clear that 
more information on the precise content of HLA haplotypes is 
needed for transplantation, disease association, anthropological, 
and epidemiological studies [8,25]. 

In this study we focused on studying HLA-DRB1 haplotype 
structures in a Finnish population. The allelic diversities of other 
traditional HLA-genes and selected group of MHC class III genes 
were included and extended haplotypes inferred. We aimed to 
interpret the haplotype diversities in relation to HLA-DRB1 locus 
because of its higher amount of polymorphism when compared to 
the other MHC Class II genes (IMGT database, [26]) and its well- 
known associations with several complex diseases. The selected 
MHC genes TNF, LTA, C4, BTNL2, and HLA-DRA all have been 
associated with several autoimmune or infectious diseases, as well. 
We hypothesized that the use of longer MHC blocks, rather than 
single alleles, could be advantageous when studying the polymor- 
phisms in the MHC region. 

Materials and Methods 

Study Population 

The study consists of one hundred and fifty (150) healthy 
Finnish individuals who were randomly selected. Description of 
the sample set and the DNA extraction has previously been 
published by Seppanen et al. [17]. Briefly, samples from 49 males 
and 101 females with mean age of 33.7 years (range 18-60), were 
used. The local ethical committee (The Ethics Committee: 
Department of Medicine, Hospital District of Helsinki and 
Uusimaa) approved the study protocol (Dnro 6/E5/ 
2001,25.1.2001). All the contributors provided the written 
informed consent. 

Genotyping of HLA Genes 

The HLA genotyping and/ or analysis was carried out in an EFI 
(European Federation for Immunogenetics) accredited HLA 
Laboratory. The genotyping of HLA-A, -B and -DPB1 genes were 
performed using sequence specific primers (SSP: Olerup SSP AB, 
Stockholm, Sweden). The HLA-DRB1 alleles were detected using 
sequence based typing (SBT; InvitrogenTM, Life Technologies, 
Carlsbad, CA, USA). HLA-DQB1 alleles were detected with a 
panel of lanthanide-labeled oligonucleotide probes [27]. The 
reactions were performed according to manufacturers' instructions 
giving at least four-digit resolution (for example, HLA- 
DKB1*01:01) for HLA-DRB1 and -DPB1, and at least two-digit 
resolution (for example, HLA-A*02) for HLA-A, -B and -DQB1. For 
resolving ambiguities, high resolution SSPs or SBT HARPs 
(heterozygous ambiguity resolving primers) were used. The HLA 
alleles were assessed using HLA nomenclature release 3.5.0 
(IMGT/HLA database) and carefully interpreted by two persons. 

SNP Selection and Genotyping 

We aimed to genotyped LTA, TNF, BTNL2 and HLA-DRA 
genes with SNPs. All the selected genes were covered entirely 
including 5' and 3'- flanking regions. SNPs were chosen from the 
HapMap database [28] and/or from the public dbSNP database 
(http://www.ncbi.nlm.nih.gov/projects/SNP). Information about 
the validation status, tagging quality, minor allele frequency 
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Table 2. The Finnish A 


-B-DRB1 haplotype frequencies (>1 c 


/o) and comparison with other populations [41 


-46]. 




Haplotype A~B~DRB1 




Finland (n = 150) 


Sami (n = 130) [44] 


Russia (n = 207) [46] 


Ireland (n = 1000) 
[41-43] 


HLA-A 


HLA-B 


HLA-DRB1 


f 


f 


f 


f 


"01 


"08 


"03:01 


0.040 


0.039 


0.050 


0.09 


*02 


*07 


"15:01 


0.035 


0.038 




0.03 


*02 


*27 


"08:01 


0.027 


0.014 






*02 


*15 


"13:01 


0.023 


0.031 


0.012 




*02 


*15 


"04:01 


0.019 


0.023 






*02 


"15 


"08:01 


0.019 








*02 


*13 


"07:01 


0.016 




0.021 




*02 


"15 


"15:01 


0.013 








*02 


"44 


"04:01 


0.012 




0.019 


0.04 


*02 


MO 


"1 3:02 


0.011 








*03 


*35 


"01:01 


0.071 


0.031 


0.033 




*03 


*08 


"03:01 


0.021 








*03 


"15 


"08:01 


0.017 


0.015 






*03 


*07 


"01:01 


0.014 




0.019 




*03 


*07 


"13:01 


0.012 








*03 


*07 


"15:01 


0.011 


0.042 


0.047 


0.05 


"11 


*07 


"15:01 


0.015 








f- frequency. 

doi:1 0.1 371 /journal.pone.0079690.t002 



(>0.01) and gene structure were used for selecting the SNPs. 
Altogether seventy-four SNPs were chosen from the two gene 
regions, TMF~LTA, (here referred as UVF block) and 
BTNL2~ HLA-DRA (here referred as BTNL2 block). 

SNP genotyping was performed using the Sequenom MassAr- 
ray iPLEX system (Sequenom, San Diego, CA, USA). In the 
iPLEX assay, the SNP alleles are separated based on the 
differences of the single base extension (SBE) products. Manufac- 
ture's instructions were used to design the assays (AssayDesign 
software) and to perform the multiplex PCR and the iPLEX 
reaction using 9-10 ng of DNA as a template. 

The complement C4A and C4B gene copy numbers and a 2-bp 
silencing insertion at exon 29 (CT) in C4A gene were analysed by 
quantitative genomic realtime-PCR Rotor-Gene 6000 (Corbett 
Research, Sydney, Australia) according to Paakkanen et al. [29]. 
The C4 allotypes were determined by immunofrxation [17]. One 
subject was excluded from the analysis. 

Statistical Analysis 

Allele, phenotype and haplotype frequencies were calculated by 
direct counting. To detect significant departure from Hardy- 
Weinberg equilibrium (p<0.001), Haploview (SNPs) [30] or 
ARLEQUIN 3.11 (HLA genes) [31], were used. 

SNP haplotypes were constructed using Haploview [30] . Multi- 
locus haplotype frequencies and recombination rates were 
estimated from allele data using the Bayesian method with 
PHASE v. 2.1.1 [32]. The haplotypes were constructed using all 
the selected markers simultaneously. Taken into account the small 
sample size and to exclude unreliable haplotypes, only haplotypes 
greater than 1% (observed more than 3 times) were used in the 
analysis [24,33-35]. 

The LD measures (D' and r 2 ) were determined using either the 
Haploview software (biallelic markers) [30] or the ARLEQUIN 



3.11 software (multiallelic markers) [31]. At this step, only one 
SNP from a LD block was chosen for further analysis. It has been 
shown that for multiallelic loci, D ' estimates the strength of LD 
(>0.80 strong LD, -0.5 moderate LD, -0 weak LD) better than 
i 2 [36]. The R-package 'ape' was used to perform a Neighbor- 
joining algorithm according to the method of Saitou and Nei [37]. 
The HapMap data was used for illustrating the recombination 
hotspots in the MHC region [28]. The proxy SNPs (r 2 >0.9) for 
genotyped SNPs were detected using software SNAP [38]. 

Results 

Genotyping of HLA-alleles 

The HLA alleles and their frequencies are given in parallel with 
the phenotype frequency in Table 1. HLA allele distributions 
followed the Hardy-Weinberg equation. A total of 9 1 HLA alleles 
(15, 21, 26, 1 1 and 18 alleles in HLA-A, -B, -DRB1, -DQB1 and - 
DPB1, respectively) were noted. In the Finnish population, two 
HLA-A alleles accounted for >66% and five HLA-DPB1 alleles for 
>91 % of the variation at the loci. Allele frequencies of other HLA 
loci were more equally distributed. These observations are 
consistent with the previous Finnish studies [39,40]. 

HLA Haplotypes 

Three-locus haplotypes between HLA-DRB1 and MHC Class I 
(HLA-A and -B; Table 2) and MHC Class II (HLA-DQBl and - 
DPB1; Table 3) were constructed. Haplotypes having frequencies 
higher than 1.0% are presented in Table 2 and 3 and compared 
with different populations that had reported the A~B~DRB1 and 
DQB1 ~ -DPB1 ~DRB1 haplotypes in The Allele Frequency Net 
Database [41-46]. The three most common A~B~DRB1 
haplotypes in the Finnish population were 
A*03~B*35~DRB1*01:01 (7.1%), A*01~B*08~DRB1*03:01 
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Table 3. The Finnish DRB1—DQB1—DPB1 haplotype frequencies (>1%) and comparison with other populations [41-46]. 



Haplotype ORB 7 


~DQB1~DPB1 




Finland (n = 150) 


Ireland (n = 250) [41-43] 


Greece (n = 246) [45] 




HLA-DRB1 


HLA-DQB1 


HLA-DPB1 


f 


f 


f 


*01:01 


♦05:01 


*04:02 


0.059 


0.024 






♦01:01 


♦05:01 


♦04:01 


0.043 


0.028 


0.035 


*01:01 


♦05:01 


•02:01 


0.040 


0.01 1 






♦03:01 


♦02 


•01:01 


0.049 


0.021 


0.014 


♦03:01 


*02 


•04:01 


0.020 


0.098 


0.015 




♦04:01 


♦03:02 


•04:01 


0.026 


0.013 




♦04:01 


♦03:02 


•04:02 


0.016 








♦04:04 


♦03:02 


•02:01 


0.013 






♦04:08 


♦03:01 


•02:01 


0.016 








•07:01 


♦02 


•04:01 


0.023 


0.017 


0.014 


♦08:01 


*04 


•03:01 


0.071 








•08:01 


♦04 


♦04:01 


0.033 






♦08:01 


*04 


•04:02 


0.014 








•09:01 


♦03:03 


♦04:02 


0.022 






♦11:01 


♦03:01 


•04:02 


0.013 








•11:01 


♦03:01 


•04:01 


0.012 




0.036 


♦11:01 


♦03:01 


•02:01 


0.012 




0.023 




•12:01 


♦03:01 


•04:01 


0.025 






♦13:01 


♦06'03 


*04"01 


0 027 


0 014 


0 016 




•13:01 


♦06:03 


♦02:01 


0.023 




0.023 


♦13:01 


♦06:03 


♦05:01 


0.012 








•13:02 


♦06:04 


♦03:01 


0.030 






♦13:02 


♦06:04 


♦04:02 


0.013 








•15:01 


♦06:02 


♦04:01 


0.090 


0.149 


0.015 


♦15:01 


♦06:02 


♦04:02 


0.020 








•15:01 


♦06:02 


♦05:01 


0.014 







f = frequency. 

Total number of haplotypes in the population sample (Haplotypes/2n) shown in frequencies (f). 
doi:1 0.1 371 /joumal.pone.0079690.t003 



(4.0%) and A*02~B*07~DRB1*15:01 (3.5%). The second most 
common HLA-DRB1 *01:01 haplotype had HLA-B*07 instead of 
HLA-B*35. The two-locus haplotypes (DRB1-DPB1, 
DQB1—DPB1 and B-DRBl) are presented in Table SI. The 
linkage between HLA-DRB1 and HLA-DQB1 was stronger than 
between HLA-DRB1 and HLA-DPB1 (Table S2). 

SNP Analysis 

Altogether 74 SNPs were successfully genotyped. The average 
success rate was 99% and no discrepancies were observed. 
Fourteen SNPs were excluded due to minor allele frequency 
(<0.01) or HWE (<0.001) and five SNPs were excluded as they 
were m total LD (r 2 = 1) with another SNP. There can be found 
several proxy SNPs (SNPs in strong LD) for our SNPs (examples 
shown in Table S3) [38]. Many of the proxy SNPs have been 
previously associated with diseases (http:/ /www.snp-nexus.org/). 

A summary of the accepted SNPs (n = 55) is given in Table S4. 
The allele frequencies of our genotyped SNPs did not differ 
significantly from the HapMap (CEU population, European 
decent) [28], except the twelve SNPs in BTNL2 or HLA-DRA 
genes (Table S4). The LD structure of the SNPs {TNF or BTJVL2 
block) is presented in the Figure SI. As expected, due to the SNP 
selection criteria the pairwise LD between SNPs was always r 2 <l. 



Previously using HapMap data [28], high recombination rates 
have been observed between HLA-A and —B loci, upstream and 
downstream of TJVF, in the BTNL2 region and between HLA- 
DQB1 and -DPB1 loci (Figure 1A). Here, the recombination rate 
estimation was performed using the HLA alleles {HLA-A, -B, - 
DRB1, DQB1, -DPB1), C4 gene copy numbers and SNPs (n = 55). 
Thirty-one SNPs were common in both HapMap and in our 
study. The location of the highest recombination rate in the 
Finnish sample was observed in the BTNL2 promoter region 
corresponding to HapMap data (Figures 1 B and C). 

Haplotypes of TNF, BTNL2 and C4 

SNP haplotypes were constructed. The rare haplotypes 
(observed less than 3 times i.e. frequency <1%) were excluded 
at this point. Nine 13-SNP haplotypes of TJVF block (Table 4) and 
twelve 42-SNP haplotype oiBTNL2 block (Table 6) were observed 
with frequencies ranging from 3% to 21% and 1.3% to 15%, 
respectively. In both regions, there seemed to be four to five 
common haplotypes of almost equal frequency in addition to the 
less common haplotypes. C4A and C4B gene copy numbers were 
constructed into haplotypes and the results are shown in Table 8. 
As opposed to TNF and BTNL2 haplotypes, there was one 
haplotype, which covered more than half of the observed 
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1A 



Recombination Rates for MHC (HapMap) 



t3 2 - 



HLA-A 



HLA-B 



TNF. 



C4B 



C4A 



^HLA^I 



DRA 

HLA-DQB1 



HLA-DPB1 



BTNL2 



HLA-DRB1 



lilUiJl 



L uL 




1 1 1 1 1 1 

30000000 30500000 31000000 31500000 32000000 32500000 33000000 33500000 



1B 

Recombination Rates for 31 SNPs (HapMap) 



Position in BP 



1C 

Recombination Rates for 31 SNPs (Study) 




-Q 

E 



t 1 r 

32370000 32390000 32410000 32430000 
Position in BP 




t 1 r 

32370000 32390000 



i 1 r 

32410000 32430000 



Position in BP 



Figure 1. Recombination rates in MHC region. 1A: The recombination rates for (HapMap [28]) in MHC region. 1B: Recombination rates in 
HapMap [28] for the 31 SNPs that are common in the current study and HapMap. 1C: Recombination rates for the 31 SNPs in the study (that are 
common in the study and HapMap [28])). The highest recombination rate was observed in the BTNL2 promoter region. 
doi:1 0.1 371 /journal.pone.0079690.g001 



haplotypes (C4_l haplotype, 59%). The frequency of the next 
common haplotype was less than 20%, showing a sharp decrease. 

The significant pair-wise LD [(D'>0.80, r 2 >0.35, P value (P) 
<0.05)] between HLA-DRB1 alleles and other markers is 
presented in Table 10. The HLA-DRB1 aUele distribution (%) 
presented in Table S2 shows how HLA-DRB1 alleles are clustered 
with MHC class I, II and III alleles and MHC blocks. 

The linkage between HLA-DRB1 and TNF block (Table 5) 
indicates that a given HLA-DRB1 allele was combined with 
different TNF blocks. Two cases, HLA-DRB1*03:01 and TNF_3 
and HLA-DRB1*13:02 and TNF_9 showed strong LD (D' =0.80; 
D' = 0.88, respectively, Table 10). 



Contrary to TNF block, the linkage between HLA-DRB1 and 
BTNL2 block (Table 7) showed that mosdy HLA-DRB1 aUeles 
{*01:01, *03:01, *04:01, *08:01, *11:01, *13:02 and *15:01) were 
strongly linked with BTNL2 blocks (Table 10, Table S2). However, 
there were some exceptions such as HLA-DRB1 *12:01 and *13:01 
(Table S2). 

The DRB1-C4 haplotypes behaved like the DRBl-TNF 
haplotypes showing association with different HLA-DRB1 alleles 
(Table 9). CAB null allele (haplotypes C4_2 and C4_3) were found 
with DRB1*01:01, *04:01, *08:01 and *13:01. HLA-DKB 1*03:01 
had C4AQ0 (C4_4; D' = 0.73; Table 10), but also a haplotype 
without a C4A null allele {C4J) was observed. HLA-DRB1*13:02 
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Table 4. TNF blocks {TNFJ- TNF_9) and their frequencies (>1.0%).HM-D/?6? Haplotypes with TNF, BTNL2 and C4 Blocks. 



SNP 


TNF_1 


TNF_2 


TNF_3 


TNF_4 


TNF_5 


TNF_6 


TNF_7 


TNF_8 


TNF_9 




rs2009658 


C 


C 


C 


C 


G 


C 


C 


G 


C 




rs2239704 


A 


C 


C 


C 


C 


A 


A 


C 


C 




rs2229094 


T 


T 


T 


c 


C 


T 


T 


C 


T 




rs2229092 


A 


A 


A 


A 


A 


A 


A 


C 


A 




rs1041981 


C 


A 


A 


C 


C 


C 


C 


C 


A 




rs 1799964 


T 


T 


T 


T 


C 


T 


T 


C 


T 




rsl 799724 


C 


C 


C 


C 


c 


C 


T 


C 


C 




rsl 800629 


G 


G 


A 


G 


G 


G 


G 


G 


G 




rs361525 


G 


G 


G 


G 


G 


G 


G 


G 


G 




rs3093664 


A 


A 


A 


A 


A 


A 


A 


A 


G 




rs7691 78 


G 


G 


G 


G 


G 


G 


T 


G 


G 




rs3093553 


T 


T 


T 


T 


T 


T 


T 


T 


T 




rs2256965 


A 


G 


G 


A 


G 


G 


A 


G 


G 




F 


0.210 


0.167 


0.153 


0.130 


0.090 


0.080 


0.057 


0.047 


0.030 


0.96# 
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had typically an insertion in the C4A gene {C4_5; D = 0.90 
Table 10). 

To summarize the LD and haplotype analysis and to the 
polymorphism of the MHC region and its relation to HLA-DRB1, 
an additional four-locus haplotype [HLA-DRB1, TNF block, 
BT.NL2 block and C4; Table 1 1) and a six-locus haplotype 
{HLA-DRB1, HLA-B, TNF block, BTNL2 block, C4A and C4B 
allotypes; Table S5) were performed. The results showed that the 
extended HLA-DRB1 haplotypes were broken down when HLA-B, 
C4 allotypes and TNF block were taken into account. 



Table 5. Two-locus haplotypes 
(>1.0%)#. 


of HLA-DRB1 with TNF block 




DRB1 allele 


TNF block 


f 


•01:01 


TNF_4 


0.103 


*03:01 


TNF_3 


0.076 


*04:01 


TNF_2 


0.022 


*04:01 


TNF_5 


0.027 


*04:08 


TNF_1 


0.016 


♦07:01 


TNF_1 


0.040 


♦08:01 


TNF_1 


0.027 


♦08:01 


TNF_2 


0.048 


♦08:01 


TNF_6 


0.030 


♦09:01 


TNF_6 


0.025 


♦11:01 


TNF_3 


0.014 


♦13:01 


TNF_1 


0.017 


♦13:01 


TNF_2 


0.035 


♦13:02 


TNF_9 


0.026 


♦15:01 


TNF_1 


0.075 


♦15:01 


TNF_4 


0.016 


♦15:01 


TNF_7 


0.014 


f- frequency. 

# Rare haplotypes were excluded from the analysis. 
doi:1 0.1 371 /journal.pone.0079690.t005 



To further analyze the structure of TNF and BTNL2 blocks we 
created phylogenetic trees of the genetic distance using sequence 
similarities (Figure S2). Three branches of TNF were observed, 
from which two branches, TNF_3 and _9 formed haplotypes with 
HLA-DKB 1*03:01 and HLA-DRB1* ■ 13:02, respectively, which we 
previously showed with high LD. The third branch divided into 
three groups, first with TNF_2, second with TNF_4, _5 and _8 and 
third with TNF_1, _6, and _7. C4BQ0 related TNF haplotypes, 
TNFJ2 and TNF_4, belonged to the same main branch. The 
phylogenetic tree of BTNL2 blocks showed that BTNL2_5 and _7 
(in linkage with HLA-DRB1*03:01 and * 13:02, respectively,) and 
BTNL2J5 (in linkage with HLA-*13:01) had different BTNL2 
block structure than the rest. HLA-DRB1* 12:01 alleles had either 
BTNL2 blocks BTNL2J0 or _11. HLA-DRB1*11:01 and *15:01 
had structurally similar BTNL2 blocks, BTNL2_8 and _2, 
respectively. HLA-DKB1*01:01, HLA-DRB1 *04 alleles and HLA- 
DRB1*07:01 and *08:01 formed a wide branch with similar 
BTNL2 blocks _3, _4, and _8, _10 and _1 1 (in linkage with HLA- 
DRB1*01:()1, *04:01-08, *07:01, *08:01, *11:01, *12:01, *15:01) 
and BTNL2_9 (in linkage with HLA-DRB1*13:01). 

Discussion 

To our knowledge, no extensive study exist that combines 
information from HLA-A,-B,-DRB1 ,-DQBl and -DPB1 alleles with 
TNF, BTNL2 and complement C4 blocks. In this study, we 
addressed (i) the diversity of extended HLA-DRB1 haplotypes 
covering the MHC class I, II and III regions, (ii) the shared MHC 
or SNP markers in extended HLA-DRB1 haplotypes, and (iii) the 
challenge in detecting causal variants in the HLA data. 

The most common Finnish A~B~DR -haplotype was 
A*03~B*35~DRB1*01:01. Also other HLA-DRB1*01:01 haplo- 
types with variable levels of LD were observed. According to The 
Allele Frequency Net Database [41], only in a few populations e.g. 
the Swedish Sami [44] and Russia [46] the 
A*03~B*35~DRB1*01:01 was found with the frequency >2%. 
In Finland, the second most common haplotype was 
A*01~B*08~DRB1*03:01 in high LD. This haplotype is not so 
common in Finns as in other Europeans and has been previously 
referred as the ancestral haplotype AH 8.1 or autoimmune 
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Table 6. BTNL2 blocks (BTNL2_1- BTNL2_12) and their frequencies (>1.0%). 





SNP 


BTNL2_ 1 


BTNL2_2 


BTNL2_3 


BTNL2_4 


BTNL2_5 


BTNL2_6 


BTNL2_7 


BTNL2_8 


BTNL2_9 


BTNL2_ 


10 BTNL2_ 


77 BTNL2_12 


rs28362678 


T 


C 


C 


C 


C 


T 


C 


C 


C 


C 


C 


C 


rs2076530 


C 


T 


C 


T 


T 


C 


T 


T 


T 


T 


T 


T 


rs9268480 


C 


C 


T 


C 


C 


C 


C 


C 


C 


C 


C 


C 


rs2076529 


c 


T 


C 


T 


T 


c 


T 


T 


T 


T 


T 


T 


rs3793127 


c 


C 


T 


C 


C 


c 


C 


C 


C 


C 


C 


C 


rs28362683 


G 


G 


G 


A 


G 


A 


G 


G 


G 


G 


G 


G 


rs3763311 


C 


C 


T 


C 


C 


C 


C 


C 


C 


C 


C 


C 


rs3763312 


G 


G 


A 


G 


G 


G 


G 


G 


G 


G 


G 


G 


rs3763313 


C 


A 


A 


C 


A 


A 


A 


A 


A 


A 


A 


C 


rs3763317 


T 


C 


T 


T 


C 


C 


C 


C 


C 


C 


C 


T 


rs5007259 


C 


T 


C 


C 


T 


T 


T 


T 


T 


T 


T 


C 


rsl 7208888 


G 


G 


G 


G 


G 


G 


G 


A 


G 


G 


G 


G 


rs9405098 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


rs9268528 


A 


A 


G 


A 


A 


A 


A 


G 


A 


G 


G 


A 


rs9268541 


T 


T 


T 


T 


T 


T 


T 


T 


T 


C 


C 


T 


rs2395166 


C 


C 


T 


T 


T 


T 


T 


T 


T 


T 


T 


C 


rs3 135365 


T 


G 


T 


T 


T 


T 


T 


T 


T 


T 


T 


T 


rs3 135363 


T 


T 


T 


T 


C 


C 


C 


C 


C 


T 


T 


T 


rs3135351 


G 


G 


G 


G 


T 


T 


T 


G 


T 


G 


G 


G 


rs3 135344 


G 


G 


A 


A 


A 


A 


A 


A 


A 


A 


A 


G 


rs3 129843 


A 


A 


A 


A 


G 


A 


A 


A 


A 


A 


A 


A 


rs3135341 


T 


G 


T 


T 


T 


T 


T 


T 


T 


T 


T 


G 


rs2027856 


C 


C 


C 


T 


C 


C 


C 


C 


C 


C 


C 


C 


rs3 129871 


C 


A 


C 


A 


C 


A 


C 


C 


C 


A 


A 


A 


rs9405035 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


rs9268644 


A 


A 


C 


C 


A 


A 


A 


C 


A 


A 


C 


C 


rs3 129877 


A 


G 


G 


G 


A 


A 


A 


G 


A 


G 


G 


G 


rs3 135392 


T 


G 


G 


T 


T 


T 


T 


G 


T 


T 


T 


T 


rs3 129882 


A 


G 


A 


G 


A 


A 


A 


G 


A 


G 


G 


G 


rs8084 


C 


A 


C 


A 


A 


A 


A 


A 


A 


A 


A 


A 


rs2239804 


A 


A 


G 


A 


A 


A 


A 


G 


A 


A 


A 


A 


rs11544315 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


A 


rs31 77928 


A 


G 


G 


G 


G 


G 


G 


G 


G 


G 


G 


A 


rs3 135388 


C 


T 


C 


C 


C 


C 


C 


C 


C 


C 


C 


C 


rs2213585 


T 


C 


T 


C 


C 


C 


C 


T 


C 


C 


C 


T 


rs6937545 


C 


A 


C 


C 


A 


A 


A 


C 


A 


C 


C 


C 


rs9268833 


C 


C 


T 


C 


C 


C 


C 


C 


C 


C 


C 


C 


rs6919855 


T 


C 


T 


T 


T 


T 


T 


c 


T 


T 


T 


T 


rs7766843 


C 


C 


C 


T 


T 


T 


T 


c 


T 


T 


T 


C 


rs2395185 


G 


G 


T 


G 


G 


G 


G 


G 


G 


G 


G 


T 


rs9268979 


T 


T 


C 


C 


C 


C 


C 


T 


C 


C 


C 


C 


rs7748472 


A 


A 


A 


A 


A 


A 


G 


A 


A 


A 


A 


A 


f 


0.150 


0.143 


0.137 


0.103 


0.097 


0.053 


0.040 


0.040 


0.020 


0.020 


0.017 


0.013 



doi:1 0.1 371 /journal.pone.0079690.t006 



haplotype [47,48]. Another conserved haplotype with strong LD, 
but rare in the Finnish population, was HLA-DRB1* 13:02 
reaching from ZTVFto HLA-DQB1. 

The enrichment or loss of certain HLA haplotypes (Table 2) 
reflects the characteristics of the Finnish population structure, 



which has evolved through multiple genetic bottlenecks [6,7]. 
Detailed information of population substructures was presented by 
the 16th International HLA and Immunogenic Workshop IHIW 
project ("Analysis of HLA Population Data" [8]. Most impor- 
tantly, the multidimensional scaling (MDS) of HLA-DRB1 revealed 
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Table 7. Two-locus haplotypes of HLA-DRB 7 with BTNL2 block 
(>1.0%)#. 



HLA-DRB1 


BTNL2 block 


f 


•01:01 


BTNL2J 


0.147 


♦03:01 


BTNL2_5 


0.097 


*04:01 


BTNL2_3 


0.064 


♦04:03 


BTNL2_3 


0.013 


*04:08 


BTNL2_3 


0.020 


•07:01 


BTNL2_3 


0.023 


•07:01 


BTNL2_12 


0.013 


♦08:01 


BTNL2_4 


0.103 


♦11:01 


BTNL2_8 


0.037 


♦12:01 


BTNL2_10 


0.020 


♦12:01 


BTNL2_1 1 


0.016 


♦13:01 


BTNL2_6 


0.053 


♦13:02 


BTNL2_7 


0.040 


♦15:01 


BTNL2_2 


0.143 



#Rare haplotypes were excluded. 
f = frequency. 

doi:1 0.1 371 /journal.pone.0079690.t007 



that the Finns and the Sami are closer to the North-East Asians 
than to other European populations [8]. In general, the HLA 
variation in Europe follows the North to Southeast axis 
corresponding to the previous principal component analysis 
(PCA) based results that utilized genome-wide SNP data [7]. 
Interestingly in Finland, the prevalence of certain HLA alleles 
have shown regional differences [40] e.g. HLA-B*35 being highest 
in the Eastern parts of Finland. Also GWASs have shown similar 
trends [7]. The HLA haplotype deviations in relation to the 
Finnish population substructure warrant replication studies. The 
population stratification is important e.g. for control selection [34]. 

We found that the majority of HLA-DRB 1 alleles were inherited 
as extended blocks from BTML2 to HLA-DQB1. In spite of the 
observed recombination rate in the BTJVL2 promoter region, most 
DRB1— BTNL2 blocks appeared to be conserved. Including HLA- 
B, TNF, C4 and HLA-DPB1 the number of extended HLA-DRB1 
haplotypes strongly increased. The positions of the MHC 
recombination sites vary between populations [49] explaining 
partly the non-replication of disease associations between popu- 
lations (e.g. [19,50,51]). 

Our multi-locus haplotype analysis shows that the extended 
HLA-DRB1 haplotypes can be grouped according to functional 
similarities. Especially interesting for disease association studies are 
the HLA-DRB1 haplotypes not common in the general population. 
Furthermore, TNF and LTA genes are cytokines involved in the 

Table 8. C4 blocks (C4_l- C4_6) and their frequencies 
(>1.0%). 



C4 gene C4_ 7 C4_2 C4_3 C4_4 C4_5 C4_6 

C4A 12 1 Q0 InsCT 2 

C4B 1 Q0 Q0 1 1 1 

f 0.59 0.17 0.09 0.08 0.03 0.03 



doi:1 0.1 371 /journal.pone.0079690.t008 



Table 9. Two-locus haplotypes of HLA-DRB1 with C4 block 
(>1.0%)#. 



HLA-DRB1 


C4 block 


f 


*01 - 01 


2 


0 083 


*01 - 01 


1 


0 032 


*01 - 01 


3 


0 029 


*03"01 


4 


0 068 


*03 - 01 




0 022 


*04 - 01 




0 038 


*04 - 01 




0 026 


*04 - 04 




0 019 


*04 - 08 




0 019 


*07 - 01 




0 046 


*08 - 01 




0 082 


*08:01 




0.041 


*09:01 




0.029 


*11:01 




0.036 


^1 2:01 




0.024 


*13:01 




0.045 


*13:01 


2 


0.032 


*13:02 


5 


0.030 


*15:01 


1 


0.120 


*15:01 


6 


0.016 



tf Rare haplotypes were excluded. 
Q0 = null allele. 
InsCT = insertion in C4A gene. 
f = frequency. 

doi:1 0.1 371 /journal.pone.0079690.t009 



activation of inflammatory processes; hence the HLA-DRB1 
haplotypes with gene expression related SNPs (rs2239704, 
rs 104 1981 and rs 1800629) [2,52] are plausible candidates for 
inflammatory diseases. For example, a rare HLA- 
DRB1*15:01~TNF_4 haplotype has a different nucleotide in 
rs2239704 compared with more frequent HLA- 
DRB1*15:01~TNF haplotypes. Interestingly, the same TNF_4 is 
also found with HLA-DRB '1 '* '01:01 shown to be associated with 
inflammatory reactions [53]. The extended HLA-DRB1 haplotypes 
of HLA-DRB1*03:01, *08:01, *11:01, *12:01, *13:02 and *15:01 
can be grouped according to exonic missense SNP (rs2076530) 
causing truncated protein of the T-cell inhibitor BT.NL2 [19]. Of 
the complement C4 proteins, the C4A null alleles were primarily 
found with two conserved haplotypes, HLA-DRB 1*03:01 (AH 8.1) 
and HLA-DRB1*13:02. C4B null alleles were characteristically 
inherited with the most common Finnish haplotype 
A*03~B*35~DRB1*01:01 or with HLA-DRB1*04:01, *08:0l 
and *13:01. C4 null alleles have shown to be related to many 
diseases [18]. Taken together, the extended HLA-DRB 1 haplotype 
analysis can reveal predisposing/protective associations between 
markers and disease loci not detectable with a single MHC allele 
[34]. 

Overall, analysis of immunogenomic data is challenging. 
Resolving HLA allele ambiguity, phasing and LD calculation 
warrant particular expertise, and the traditional software tools (e.g. 
Haploview and PLINK) are not suitable for multiple loci 
polymorphic data like HLA [34,54]. Haplotypes rather than 
single markers were used to decrease phasing errors [55] . Due to 
the strong LD, multiple SNPs may have corresponding statistical 
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Table 11. The extended HLA-DRB 1 haplotypes with TNF, C4 
and BTNL2 blocks (>1%). 



HLA-DRB1 


TNF block 


C4 block 


BTNL2 block 


f 




♦01:01 


TNF 




2 


BTNL2J 


0.059 


♦01:01 


TNF 


4 


3 


BTNL2_1 


0.026 




♦01:01 


TNF 




1 


BTNL2_1 


0.017 


*01:01 


TNF 


g 


2 


BTNL2_1 


0.012 




♦03:01 


TNF 




4 


BTNL2_5 


0.065 


♦03:01 


TNF 




1 


BTNL2_5 


0.011 




♦04:01 


TNF 




1 


BTNL2_3 


0.024 


♦04:01 


TNF 


2 




BTNL2_3 


0.022 




♦04:08 


TNF 


i 


1 


BTNL2_3 


0.013 


♦07:01 


TNF 


1 


1 


BTNL2_3 


0.023 




♦08:01 


TNF 




1 


BTNL2_4 


0.028 


♦08:01 


TNF. 


2 


1 


BTNL2_4 


0.022 




♦08:01 


TNF. 


_2 




BTNL2_4 


0.022 


♦08:01 


TNF. 


_1 




BTNL2_4 


0.016 




♦11:01 


TNF. 


_5 




BTNL2_8 


0.011 


♦13:01 


TNF. 


_2 




BTNL2_6 


0.014 




♦13:01 


TNF. 


_1 




BTNL2_6 


0.012 


♦13:02 


TNF. 


_9 




BTNL2_7 


0.027 




♦15:01 


TNF. 


_1 




BTNL2_2 


0.059 


♦15:01 


TNF. 


_7 




BTNL2_2 


0.016 




♦15:01 


TNF. 


_5 




BTNL2_2 


0.012 



f=haplotype frequency. 

doi:1 0.1 371 /journal.pone.0079690.t01 1 



proof of association making the search for possible causal variants 
exceptionally difficult. To clarify the complexity, the known tag- 
SNP for HLA-DRB1*15:01 (rs3 135388) (see Figure S3), was shown 
to have at least 20 proxy SNPs (r 2 >0.9) with variable function and 
in different gene regions [14,22,38]. The SNPs' allele frequency 
might be also population specific (rs22 13585; Table S4), and thus 
the tag-SNPing (i.e. imputation) should not be used unless the 
ethnic background is known [56] . Here in this material, except the 
tag-SNP for HLA-DRB1 * 15:01 (rs3 135388) [14,22], we did not 
detect any single tag-SNP for a specific HLA-DRB1 allele. Indeed, 
large population specific cohorts and dense SNP genotyping is 
needed for detecting HLA tag-SNPs. 

We acknowledge that the multiple ambiguous alleles, limited 
sample size, and rare HLA alleles can influence the haplotype 
phasing and LD leading to false positive results [34]. In case of 
small sample size, the study of the rare MHC haplotypes is 
challenging. Thus, we presented only frequent MHC haplotypes 
(>1%) and interpreted the LD between markers carefully 
[34,35,54,55,57]. The HLA allele distributions were consistent 
with the previously published Finnish registry studies [39,40] 
suggesting that the Finnish HLA profile can be estimated with a 
sample set containing 150 individuals. 

One of the limitations of this study was the lack of genome-wide 
SNP data. Hence, we were not able to use HLA*IMP [58] for 
allele imputation or analyse the HLA tagging SNPs in the Finnish 
populations [22]. Because of the differences between populations, 



population specific validation is highly recommended before using 
either the HLA*IMP or the HLA tagging SNPs [7,14,23]. 

Taken together, we stress the importance of understanding the 
population specific MHC haplotypes and the analysis of immu- 
nogenetic data. The study of extended HLA-DRB1 haplotypes 
indicates the functionality of the implicated genes and provides 
hypotheses for further assessment of HLA-DRB1. The results 
presented here assist for disease association studies focusing in 
chronic inflammatory, autoimmune and infectious diseases. 

Supporting Information 

Figure SI The LD (r 2 ) structure of TNF and BTNL2 
blocks using SNPs. 

(TIF) 

Figure S2 Phylogenetic trees based on the genetic 
distance of TNF and BTNL2 blocks with bootstrap 
values. 

(TIF) 

Figure S3 A known tag-SNP for HLA-DRB1*15:01 is in 
strong LD with many other SNPs in the MHC Class III 
region. The known proxies are taken from the HapMap [28] and 
using the software SNAP [38] . 
(TIF) 

Table SI The two-locus haplotypes with frequency 
>1%. 

(DOC) 

Table S2 The observed HLA-DRB1 alleles ( % ) with HLA- 
A, -B, -DQB1 and -DPB1 alleles and TNF, C4 and BTNL2 
blocks. 

(DOC) 

Table S3 In the database (HapMap or lOOOGenomes) 
there can be found several proxy SNPs (r 2 >0.9) for the 
genotyped SNPs. 

(DOC) 

Table S4 A summary of the accepted SNPs (n = 55). The 

Finnish allele frequencies were compared with HapMap project 

(CEU) data [28]. 

(DOC) 

Table S5 The HLA-DRB1 haplotypes with HLA-B alleles, 
TNF and BTNL2 blocks and C4 allotypes (>1%). 

(DOC) 

Acknowledgments 

The authors thank the subjects for their participation and the clinicians for 
their time and efforts to make this study possible. We also thank Leena 
Saraste and Mari Kaunisto for reviewing the English language and Minna 
Veini, Kukka Aimonen, Kaisa Roine, Hanna Vauhkonen, Anil Palikhc, 
Krista Salli, Jagoda Lasota, Minna Haanpaa and Mari Kaunisto for their 
contribution in gene analyses. 

Author Contributions 

Conceived and designed the experiments: AW EV LEL MLL. Performed 
the experiments: AW EV LEL MLL. Analyzed the data: AW EV LEL 
KTE RP MLL. Contributed reagents/materials/analysis tools: AW EV 
KTE MS. Wrote the paper: AW EV LEL RP KTE MS MLL. 



PLOS ONE | www.plosone.org 



11 



November 2013 | Volume 8 | Issue 11 | e79690 



Diversity of Extended HLA-DRB1 Haplotypes 



References 

1. Horton R, Wilming L, Rand V, Lovering RC, Bruford EA, ct al. (2004) Gene 
map of the extended human MHC. Nat Rev Genet 5: 889-899. 10.1038/ 
nrgl489. 

2. Knight JC, Keating BJ, Kwiatkowski DP (2001; Allele-speeilir repression of 
lymphotoxin-alpha by aetivated B eell factor- 1. Nat Genet 36: 394-399. 
10.1038/ngl331. 

3. Nguyen T, Liu XK, Zhang Y, Dong C (2006) BTNL2, a butyrophilin-like 
molecule that functions to inhibit T cell activation. J Immunol 176: 7354-7360. 

4. Jeffreys AJ, Kauppi L, Neumann R (2001) Intensely punctate meiotic 
recombination in the class II region of the major histocompatibility complex. 
Nat Genet 29: 217-222. 10.1038/ngl001-217. 

5. Kauppi L, Sajantila A, Jeffreys AJ (2003) Recombination hotspots rather than 
population history dominate linkage disequilibrium in the MHC class II region. 
Hum Mol Genet 12: 33-40. 

6. Jakkula E, Rehnstrom K, Varilo T, Pietilainen OP, Paunio T, et al. (2008) The 
genome-wide patterns of variation expose significant substructure in a founder 
population. Am J Hum Genet 83: 787-794. 10.1016/j.ajhg.2008. 1 1 .005; 
10.1016/j.ajhg.2008.11.005. 

7. Nclis M, Esko T, Magi R, Zimprich F, Zimprich A, ct al. (2009) Genetic 
structure of europeans: A view from the north-east. PLoS One 4: e5472. 
10. 1 37 1 /journal.pone.0005472; 10. 1 37 1 /journal.pone.0005472. 

8. Riccio ME, Buhler S, Nuncs JM, Vangenot C, Cucnod M, ct al. (2013) 16(th) 
IHIW: Analysis of HLA population data, with updated results for 1996 to 2012 
workshop data (AHPD project report). Int J Immunogenet 40: 21-30. 10. 1 1 1 1/ 
iji.12033; 10.1 1 1 12033. 

9. Lee SJ, Klein J, Haagenson M, Baxter-Lowe LA, Confer DL, et al. (2007) High- 
resolution donor-recipient HLA matching contributes to the success of unrelated 
donor marrow transplantation. Blood 1 10: 4576-4583. 10. 1 182/blood-2007-06- 
097386. 

10. Kawasc T, Morishima Y, Matsuo K, Kashiwase K, Inoko H, et al. (2007) High- 
risk HLA allele mismatch combinations responsible for severe acute graft-versus- 
host disease and implication for its molecular mechanism. Blood 110: 2235- 
2241. 10.11 82/blood-2007-02-072405. 

1 1 . Petcrsdorf EW, Malkki M, Horowitz MM, Spellman SR, Haagenson MD, et al. 
(2013) Mapping MHC haplotype effects in unrelated donor hematopoietic cell 
transplantation. Blood 121: 1896-1905. 10. 1 182/blood-2012-l 1-465161; 
10.1182 /blood-20 1 2- 1 1 -465 1 6 1 . 

12. Palikhe A, Sinisalo J, Seppanen M, Valtonen V, Nieminen MS, et al. (2007) 
Human MHC region harbors both susceptibility and protective haplotypes for 
coronary artery disease. Tissue Antigens 69: 47-55. 10. 1 1 1 1 /j. 1 399- 
0039.2006.00735.x. 

13. Kallio-Laine K, Seppanen M, Aittoniemi J, Kautiainen H, Seppala I, et al. 
(2010) HLA-DRB 1*01 allele and low plasma immunoglobulin G 1 concentration 
may predispose to herpes-associated recurrent lymphocytic meningitis. Hum 
Immunol 71: 179-181. 10.1016/j.humimm.2009.10.014. 

14. de Bakker PI, Raychaudhuri S (2012) Interrogating the major histocompatibility 
complex with high-throughput genomics. Hum Mol Genet 21: R29-36. 
10.1093/hmg/dds384. 

15. Gough SC, Simmonds MJ (2007) The HLA region and autoimmune disease: 
Associations and mechanisms of action. Curr Genomics 8: 453-465. 10.2174/ 
138920207783591690; 10.2174/138920207783591690. 

16. Kainulaincn L, Peltola V, Seppanen M, Viander M, He Q, et al. (2012) C4A 
deficiency in children and adolescents with recurrent respiratory infections. 
Hum Immunol 73: 498-501. 10.1016/j.humimm.2012.02.015; 10.1016/ 
j.humimm.2012.02.015. 

17. Seppanen M, Suvilchto J, Lokki ML, Notkola IL, Jarvincn A, ct al. (2006) 
Immunoglobulins and complement factor C4 in adult rhinosinusitis. Clin Exp 
Immunol 145: 219-227. 10. 1 1 1 1/j. 1365-2249.2006.03 134.x. 

18. Samano ES, Ribciro Ldc M, Gorcscu RG, Rocha KG, Grumach AS (2004) 
Involvement of C4 allotypes in the pathogenesis of human diseases. Rev Hosp 
Clin Fac Med Sao Paulo 59: 138-144./S0041-87812004000300009. 

19. Valcntonytc R, Hampe J, Huse K, Rosenstiel P, Albrecht M, ct al (2005) 
Sarcoidosis is associated with a truncating splice site mutation in BTNL2. Nat 
Genet 37: 357-364. 10. 1038/ngl5 19. 

20. Mitsunaga S, Hosomichi K, Okudaira Y, Nakaoka H, Kunii N, et al. (2013) 
Exome sequencing identifies novel rheumatoid arthritis-susceptible variants in 
the BTNL2.J Hum Genet. 10.1038/jhg.2013.2; 10.1038/jhg.2013.2. 

21. Koskinen L, RomanosJ, Kaukincn K, Mustalahti K, Korponay-Szabo I, et al. 
(2009) Cost-effective HLA typing with tagging SNPs predicts celiac disease risk 
haplotypes in the finnish, hungarian, and italian populations. Immunogeneties 
61: 247-256. 10.1007A00251-009-0361-3; 10.1007/s00251-009-0361-3. 

22. dc Bakker PI, McVcan G, Sabeti PC, Miretti MM, Green T, et al. (2006) A 
high-resolution HLA and SNP haplotype map for disease association studies in 
the extended human MHC. Nat Genet 38: 1166-1172. 10.1038/ngl885. 

23. Monsuur AJ, dc Bakker PI, Zhcrnakova A, Pinto D, Verduijn W, et al. (2008) 
Effective detection of human leukocyte antigen risk alleles in celiac disease using 
tag single nucleotide polymorphisms. PLoS One 3: c2270. 10.1371/journal.- 
pone.0002270; 10. 1 37 1 /journal.pone.0002270. 

24. Buhler S, Nunes JM, Nicoloso G, Tiercy JM, Sanchcz-Mazas A (2012) The 
heterogeneous HLA genetic makeup of the Swiss population. PLoS One 7: 
e41400. 10.1371/journal.pone.0041400; 10.1371/joumal.pone.0041400. 



25. Mack SJ, Tu B, Lazaro A, Yang R, Lancaster AK, ct al. (2009) HLA- A, -B, -C, 
and -DRB1 allele and haplotype frequencies distinguish eastern european 
americans from the general european american population. Tissue Antigens 73: 
17-32. 10.1 1 1 1/j. 1 399-0039. 2008. 0 1 15 1.x; 10. 1 1 1 1/j. 1 399- 
0039.2008.01151.x. 

26. Robinson J, HalliwellJA, McWilliam H, Lopez R, Parham P, ct al. (2013) The 
IMGT/HLA database. Nucleic Acids Res 41: D1222-7. 10. 1093/nar/gks949; 
10.1093/nar/gks949. 

27. Laaksonen M, Pastinen T, Sjoroos M, Kuokkanen S, RuutiainenJ, et al. (2002) 
HLA class II associated risk and protection against multiple sclerosis-a finnish 
family study. J Ncuroimmunol 122: 140-145. 

28. International HapMap Consortium (2003) The international HapMap project. 
Nature 426: 789-796. 10.1038/nature02168. 

29. Paakkanen R, Vauhkoncn H, Eronen KT, Jarvincn A, Seppanen M, et al. 
(2012) Copy number analysis of complement C4A, C4B and C4A silencing 
mutation by real-time quantitative polymerase chain reaction. PLoS One 7: 
e38813. 10.1371/journal.pone.0038813; 10.1371/journal.ponc.0038813. 

30. Barrett JC, Fry B, Mailer J, Daly MJ (2005) Haploview: Analysis and 
visualization of LD and haplotype maps. Bioinformatics 21: 263-265. 
10. 1093/bioinformatics/bth457. 

31. Excoffier L, Laval G, Schneider S (2005) Arlcquin (version 3.0): An integrated 
software package for population genetics data analysis. Evol Bioinform Online 1 : 
47-50. 

32. Stephens M, Smith NJ, Donnelly P (2001) A new statistical method for haplotype 
reconstruction from population data. Am J Hum Genet 68: 978-989. 10.1086/ 
319501. 

33. Mack SJ, Tu B, Yang R, Masaberg C, Ng J, ct al. (2011) Human leukocyte 
antigcn-A, -B, -C, -DRB1 allele and haplotype frequencies in americans 
originating from southern europe: Contrasting patterns of population differen- 
tiation between italian and Spanish americans. Hum Immunol 72: 144-149. 
10.1016/j.humimm.2010.10.017; 10.1016/j.humimm.2010.10.017. 

34. Hollcnbach JA, Mack SJ, Gourraud PA, Single RM, Maicrs M, et al. (201 1) A 
community standard for immunogenomic data reporting and analysis: Proposal 
for a STrengthening the REporting of immunogenomic studies statement. 
Tissue Antigens 78: 333-344. 10.1 1 1 1/j. 1399-0039.201 1.01777.x; 10.1111/ 
j.1399-0039.201 1.01777.x. 

35. Pan Q, Fan S, Wang X, Zhao X, Pan M, ct al. (201 1) The distribution of HLA- 
A, -B, and -DRB1 alleles and haplotypes in inhabitants of guizhou province of 
china. J BiomcdRcs 25: 328-334. 10. 1016/S167+-830 1(1 1)60044-4; 10.1016/ 
81674-8301(11)60044-4. 

36. Zapata C (2000) The D' measure of overall gametic disequilibrium between 
pairs of multiallclic loci. Evolution 54: 1809-1812. 

37. Paradis E, Claude J, Strimmer K (2004) APE: Analyses of phylogenetics and 
evolution in R language. Bioinformatics 20: 289-290. 

38. Johnson AD, Handsaker RE, Pulit SL, Nizzari MM, O'Donnell CJ, et al. (2008) 
SNAP: A web-based tool for identification and annotation of proxy SNPs using 
HapMap. Bioinformatics 24: 2938-2939. 10.1093/bioinformatics/btn564; 
10.1093/bioinformatics/btn564. 

39. Haimila K, Penttila A, Arvola A, Auvinen MK, Korhoncn M (2013) Analysis of 
the adequate size of a cord blood bank and comparison of HLA haplotype 
distributions between four populations. Hum Immunol 74: 189-195. 10.1016/ 
j.humimm.2012. 10.018; 10.1016/j.humirnm.2012. 10.018. 

40. Siren MK, Sareneva H, Lokki ML, Koskimies S (1996) Unique HLA antigen 
frequencies in the finnish population. Tissue Antigens 48: 703—707. 

41. Gonzalcz-Galarza FF, Christmas S, Middlcton D, Jones AR (2011) Allele 
frequency net: A database and online repository for immune gene frequencies in 
worldwide populations. Nucleic Acids Res 39: D913-9. 10.1093/nar/gkql 128; 
10.1093/nar/gkqll28. 

42. Williams F, Meenagh A, Maxwell AP, Middlcton D (1999) Allele resolution of 
HLA-A using oligonucleotide probes in a two-stage typing strategy. Tissue 
Antigens 54: 59-68. 

43. Middlcton D, Williams F, Hamill MA, Meenagh A (2000) Frequency of HIA-B 
alleles in a eaucasoid population determined by a two-stage PCR-SSOP typing 
strategy. Hum Immunol 61: 1285-1297. 

44. Johansson A, Ingman M, Mack SJ, Erlich H, Gyllcnstcn U (2008) Genetic origin 
of the Swedish sami inferred from HLA class I and class II allele frequencies. 
Eur J Hum Genet 16: 1341-1349. 10. 1 038/cjhg.2008.88; 10.1038/ 
ejhg.2008.88. 

45. Papassavas EC, Spyropoulou-Vlachou M, Papassavas AC, Schipper RF, 
Doxiadis IN, et al. (2000) MHC class I and class II phenotype, gene, and 
haplotype frequencies in greeks using molecular typing data. Hum Immunol 6 1 : 
615-623. 

46. Arnaiz-Villena A, Martinez-Laso J, Moscoso J, Livshits G, Zamora J, et al. 
(2003) HLA genes in the chuvashian population from european russia: 
Admixture of central european and mediterranean populations. Hum Biol 75: 
375-392. 

47. Yunis EJ, Larsen CE, Fernandez-Vina M, Awdeh ZL, Romero T, et al. (2003) 
Inheritable variable sizes of DNA stretches in the human MHC: Conserved 
extended haplotypes and their fragments or blocks. Tissue Antigens 62: 1-20. 



PLOS ONE | www.plosone.org 



12 



November 2013 | Volume 8 | Issue 11 | e79690 



Diversity of Extended HLA-DRB1 Haplotypes 



48. Price P, Witt C, Allcock R, Saycr D, Garlcpp M, et al. (1999) The genetic basis 
for the association of the 8.1 ancestral haplotypc (Al, B8, DR3) with multiple 
immunopathological diseases. Immunol Rev 167: 257-274. 

49. Lam TH, Shcn M, Chia JM, Chan SH, Ren EC (2013) Population-specific 
recombination sites within the human MHC region. Heredity (Edinb). 10. 1038/ 
hdy.2013.27; 10.1038/hdy.2013.27. 

50. Rybicki BA, WalewskiJL, Maliarik MJ, Kian H, Iannuzzi MC, et al. (2005) The 
BTNL2 gene and sarcoidosis susceptibility in african americans and whites. 
AmJ Hum Genet 77: 491-499. 10.1086/444435. 

51. Spagnolo P, Sato H, Gratters JC, Renzoni EA, Marshall SE, ct al. (2007) 
Analysis of BTNL2 genetic polymorphisms in british and dutch patients with 
sarcoidosis. Tissue Antigens 70: 219-227. 10. 1 1 1 1/j. 1399-0039.2007. 00879.x. 

52. Ozaki K, Ohnishi Y, Iida A, Sckinc A, Yamada R, ct al. (2002) Functional SNPs 
in the lymphotoxin-alpha gene that are associated with susceptibility to 
myocardial infarction. Nat Genet 32: 650-654. 10.1038/ngl047. 

53. Sane J, Kurkela S, Desdouits M, Kalimo H, Mazalrey S, ct al. (2012) Prolonged 
myalgia in sindbis virus infection: Case description and in vitro infection of 
myotubes and myoblasts. J Infect Dis 206: 407-414. 10. 1093/infdis/jis358. 

54. Cano P, Klitz W, Mack SJ, Maiers M, Marsh SG, et al. (2007) Common and 
well-documented HLA alleles: Report of the ad-hoc committee of the american 



society for histocompatiblity and immunogenetics. Hum Immunol 68: 392-417. 
10.1016/j.humimm.2007.01.014. 

55. Furst D, Zollikofcr C, Schrezenmeier H, Mytilincos J (2012) TNEA promoter 
alleles-frequcncies and linkage with classical HLA genes in a south german 
Caucasian population. Tissue Antigens 80: 502-508. 10.1 1 1 1/tan. 12025; 
10.1111/tan.l2025. 

56. Valcnte EP, Tan CR, Temple SE, Phipps M, Witt CS, ct al. (2009) The 
evolution and diversity of TNE block haplotypes in europcan, asian and 
australian aboriginal populations. Genes Immun 10: 607-615. 10.1038/ 
gcnc.2009.45; 10.1038/genc.2009.45. 

57. Gourraud PA, Meenagh A, Cambon-Thomscn A, Middleton D (2010) Linkage 
disequilibrium organization of the human KIR superlocus: Implications lor KIR 
data analyses. Immunogenetics 62: 729-740. 10.1007/s00251-010-0478-4; 
10.1 007A0025 1 -0 10-0478-4. 

58. Dilthcy AT, Moutsianas L, Leslie S, McVean G (201 1) HLA*IMP-an integrated 
framework for imputing classical HLA alleles from SNP genotypes. Bio infor- 
matics 27: 968-972. 10.1093/bioinformatics/btr061; 10.1093/bioinformatics/ 
btr061. 



PLOS ONE | www.plosone.org 



13 



November 2013 | Volume 8 | Issue 11 | e79690 



